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Abstract. CVcat is a database that contains published data on cataclysmic variables and related objects. Unlike in 
the existing online sources, the users are allowed to add data to the catalogue. The concept of an "open catalogue" 
approach is reviewed together with the experience from one year of public usage of CVcat. New concepts to be 
included in the upcoming AstroCat framework and the next CVcat implementation are presented. CVcat can be 
found at http : //www . cvcat . org. 
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CVcat is an interactive database or "online catalogue" 
that offers a number of features so far unknown to scien- 
tific catalogues. It was developed as a tool for the research 
community working on cataclysmic variables (CVs), a 
class of close interacting binaries, and as a case study for 
some of the concepts to be used in the development of a 
general catalogue software, AstroCat. CVcat can be ac- 
cessed online at http : //www . cvcat . org. It was first pre- 
sented to the public in A ugust 2001 durin g a CV con- 
ference held in Gottingen llKube et al.l l2002h Since then, 
the number of users of CVcat has increased to more than 
one hundred, the daily average of requests is around fifty 
(total number of delivered pages). 



2. The concept 

CVcat has been developed in order to overcome ma- 
jor conceptual shortc omings of existing CV catalogues: 
IRitter fc Kolbl ( 1998) include only systems with known 
orbital period, which limit s their catalogue to ~ 1 /3 of all 
CVs and related objects. iDownes et a l. (1997) do list all 
known CVs, but their catalogue provides only very lim- 
ited information on each individual system, i.e., the only 
binary parameter included is the orbital period. Our aim 
was to develop an online data base that combines the in- 
formation of the existing catalogues and allows the users 
to actively contribute to the content of the data base, im- 
plementing a first version of an "open catalogue" . 



* Present affiliation: Alfred-Wegener-Institute for Polar- 
and Marine Research, Telegrafenberg A43, D-144^3 Potsdam, 
Germany; Koldewey-Station, N-9173 Ny-Alesund, Norway 



CVcat differs from other CV catalogues and other as- 
tronomical databases in the concept of the data input. 
So far, the majority of astronomical catalogues have been 
compiled by relatively small editorial teams consisting 
of scientists kn owledgeable in the fields covered by the 
catalogues (e.g. Dowries et al.lll997t IRitter fc Kolblll99St 
iMcCook fc Sionlll999t iLiu et alJl200lh . These catalogues 
typically contain more or less detailed information on a 
specific class of astronomical objects. Updates are pub- 
lished, if at all, only on a very irregular basis. The cata- 
logues contain just one value for each listed property (e.g. 
distance, orbital period) of a given object. While this is 
helpful for non-specialist users to obtain a quick overview 
of the properties of an individual object, or of the statisti- 
cal properties of a given group of objects, the more expert 
user will certainly benefit if different and possibly com- 
peting values for a given parameter are referenced in such 
catalogues. This is particularly useful if the information 
has been obtained by different methods. 

Some of the aforementioned catalogues moved from 
"classical" printed publication to online web-based publi- 
cation, which allow s shorter upda t e cycl es [e.g. the "liv- 
ing edition" of the IDownes et al.l l|l997j) CV catalogue, 
IDownes et al.l l|200lh ]. however, the overall concepts re- 
mained unchanged. In addition to these specialized cat- 
alogues for a specific object class there e xists huge data 
bases like SIMBAD ijWenger et al.ll2000j) . which provide 
very basic properties for an enormously large number of 
objects. However, due to the very global coverage of as- 
tronomical objects, data contained in SIMBAD are prone 
to be incomplete and/or inaccurate. 

The concept of an "open catalogue" implemented in 
CVcat permits every registered user to add data to the 
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(a) Classic 
Editor 



(b) current CVcat (c) AstroCat/CVcat 

Editor Editor 



add new data read data 




remove or approve data 

change data 



Fig. 1. Classical, current, and future concepts for adding 
and editing catalogue data in comparison. 



catalogue, which is instantly visible to all other users. 
The quality control is performed by an editorial team 
(Sect. 2.1), which may alter or remove erroneous data. For 
every property of an object, an arbitrary number of val- 
ues can be stored (e.g. several published values for the 
distance). CVcat returns one of these values as the "best 
available" value, selected as such by the editors. However, 
as such a selection process often involves some subtle sub- 
jective view of the editor, the more expert user may decide 
to inspect the original sources for the competing values, 
and, thereafter, decide based on his/her experience which 
value is best suited for a given purpose. 

A sketch of the different concepts of data input and 
validation is given in Fig. ^ (a) for classical catalogues 
and (b) for the CVcat concept, (c) introduces the concept 
that will be implemented in the next release of CVcat, 
which is adressed in Sect.0] In the future version, newly- 
added data will be instantly visible as it is now [Fig. ^ 
(b)], but will be tagged as "unapproved" until an editor 
cross-checks the data. In the current implementation (b), 
the user cannot see if a database entry has been approved 
by an editor. 

2.1. Distributed editorial team 

The editorial team of CVcat is recruited from the CV re- 
search community, and consists (ideally) of one expert per 
CV subclass. Since these editors are typically familiar with 
the publications on their "favourite" objects anyway, the 
amount of work to cross-check newly entered data is low- 
ered. We estimate that for a typical subclass of CVs, say, 
polars, the time to be spent on editorial duties in CVcat 
is of the order of two hours per week or less. Each editor 
has the privilege to remove or change erroneous data for 
that person's object class only. For data on all other ob- 
ject classes, the editor is a non-privileged user who may 
add new data and browse the catalogue. Note that every 



user may add data, in contrast to, e.g., SIMBAD, where 
only the editorial team can directly modify the catalogue 
content. 

2.2. Database content 

In contrast to the existing CV catalogues, CVcat is de- 
signed to contain a great variety of information for each 
object. Currently, the following object properties can be 
included, with the number in brackets being the number of 
entries on September 17, 2002 (note that because CVcat 
allows multiple entries for a given property of an object 
these numbers do not represent the number of unique ob- 
jects for which a particular property is known): 

High state magnitude (922), low state magnitude 
(794), optical spectrum exists (665), orbital period (509), 
distance (221), general magnitude (209), primary mass 
(156), inclination (153), secondary mass (142), superhump 
period (128), secondary spectral type (121), mass ratio 
(93), Dopplcr tomogram exists (80), primary temperature 
(76), optical light curve exists (61), hydrogen column den- 
sity (58), orbital ephemeris (55), eclipsing (48), spin period 
(36), primary radius (33), secondary radius (29), uv data 
exists (22), general magnetic field strength of primary star 
(18), field strength of primary magnetic pole (17), x-ray 
data exists (14), eclipse map exists (11), secondary tem- 
perature (8), colatitude of primary magnetic pole (5), field 
strength of secondary magnetic pole (3), spin ephemeris 
(3), azimuth of primary magnetic pole (2), magnitude in 
eclipse (2), azimuth of secondary magnetic pole (1). 

Every value stored in CVcat is linked to its origi- 
nal publication, either in the NASA ADS using the ADS 
bibcode of the paper l|Kurtz et all 12000). to the astro- 
ph /arXiv e-print ar chive, or to the VSNET messages 
l|Nogami et all 11997]) . Besides references to the publica- 
tions from which the data entries contained in CVcat are 
taken, a list of articles with general information on a given 
object can be stored in CVcat. 

We allow inclusion of data from astro-ph, which is 
not necessarily identical to the data published in the fi- 
nal refereed version (or which may in some cases never 
make it through the refereeing stage). Most of the astro- 
ph data, however, is promoted to refereed information at 
some point. It is a typical task of the editors to track such 
updates and to conduct the appropriate changes to the 
database, i.e. changing the source from an astro-ph to an 
ADS bibcode. 



2.3. Searching the database 

Data retrieval from CVcat works in two ways: (i) the user 
obtains all available data for a specific object, which can 
be found using its object type, its coordinates, or one of 
its names, (ii) the user creates a table containing selected 
properties for a list of objects. The latter method allows 
the user to create data tables suitable for easy graphics 
generation as well as ready-to-publish M^X-tables. 
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Fig. 2. Search form in CVcat: A large fraction of the 
search form focuses on the detailed selection of the ob- 
ject class. 



SS Cyg 

Add new data * Add new literature 


edit | 






Alternate Name: BD+42 4189a 








Object classification: (no reference) 

• cataclysmic variable 

• SS Cyg star (U Gem variable of SS Cyg subtype) 

• dwarf nova 






Right Ascension (2000.0000) 


21 42 42.66 


1997PASP..109..345D 




Declination (2000.0000) 


+43 35 09.5 


1997PASP..109..345D 




Galactic longitude 


90.559 


Calculated from coordinates 




Galactic latitude 


-7,110 




Orbital Period [min] 


396.187 


1 990MNRAS.246..B54F 
2001 -Nov-1 4, TPP-Group (bh) 


edit | 


High state magnitude [v] 


8.2 


1 997PASP..109..345D 


edit | 


2001 -Jan-19, TPP-Group (bo) 


Low state magnitude [v] 


12.1 


1 997PASP..109..345D 


edit | 


2001 -Jan-19, TPP-Group (bo) 


Inclination [deg] 


37+5 


1 998A&A5..129 ..83R 


edit | 


2000-Jun-9, TPP-Group (bo) 


Primary Mass [Solar Mass] 


1.19±0.02 


1 990MNRA5.246 B54F 


edit | 


2001 -Nov-1 4, TPP-Group (bh) 


Secondary Mass [Solar Mass] 


0.704±0.002 


1 990MMRAS.248 854F 
2001 -Nov-1 4, TPP-Group (bh) 


edit | 


Mass Ratio [M1/M2] 


1.69±0.06 


1 990MMRAS 248 354F 


edit | 


2001 -Nov-I A, TPP-Group (bh) 


Secondary Spectral Type [MK-Type] 


K4V 


1984ApJ...286 747H 


edit | 


2001 -Nov-I A, TPP-Group (bh) 


Secondary Spectral Type [MK-Type] 


K4 


2000AJ....120.2649H 


edit | 
... i 


2000-NOV-21 , TPP-Group (bh) 
1 qqqfln I ^1 ri 93U 



Fig. 3. Search result in CVcat: 
for SS Cygni 



Some of the data stored 



Searching in CVcat is organized as a two-step process. 
In the first step, the user can enter the search pattern, 
which can be the name or a substring of the name, a set 
of object classes, and the coordinate range of the object 
(Fig. I2J . The object class is selected using a grid of logical 
operators ("and", "or", and "not"). It is possible, e.g., to 
look for dwarf novae, which have also been observed as no- 
vae: choose "and DN" and "and Nova" . Another example 
is to look for nova- likes which do not show the SW Sex phe- 
nomenon: the corresponding selection would be "or NL" 
and "not SW" . 

After submitting this search request a list with all 
objects matching the search criteria is returned (not 
shown). In this list, all object names arc hyperlinks to the 



page showing all results for the specific object (Fig. EJJ. 
Alternatively, a set of objects can be chosen from this list 
to generate a user-configurable list with certain properties 
of the objects. This list may be adjusted in a way that ob- 
jects without a published value for a certain property are 
not included. By iterative calls of the list generator it is 
possible to distill a table containing e.g. all objects with 
known masses and periods. 

An example of the results of such a list creation process 
is given in the following section. 

2.4. Example: Orbital periods and donor masses 

Using the data contained in the CVcat database, we have 
plotted the secondary star masses as a function of the 
orbital period, Fig. for all CVs. A linear trend with 
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0.1LP/h-0.06 



(1) 



is clearly visible. This is predicted by theoreti cal consider- 
ation s, where the reasoning is roughly this l)Frank et alJ 
1992): Using an approximation for the Roche geome- 
try, valid for 1.3 ^ q ^ 10, and Kepler's law, one 
finds that the mean density of a Roche lobe filling 
star is a function of only the orbital period. With the 
knowledge of the lower main-sequence M/ R re lation 
jKippenhahn fc Weigertl ll99Cl iBaraffe et allll998(l . one 
then finds 



Mi 

M 



O.llP/h 



(2) 



A more detailed analysis of secondary star 
masses leads to sligh tly different period-mass-relations 
llSmith Dhi lloTii ri99^ : 



M 2 
M Q 
M 2 
M 



(0.038 ±0.003)(P/h)( 1 - 58±0 09 ) or 



(0.126 ± 0.011)P/h - (0.11 ± 0.04) 



(3) 
(4) 



The data currently available in CVcat are consistent with 
the results from Smith & Dhillon (Fig.^J). Note that some 
of the published secondary star masses may be derived 
from the orbital period using some theoretical models, 
hence artificially stabilizing the fit close to the theoret- 
ical predictions. 

3. Usage statistics 

The log file analysis of CVcat usage over a one year pe- 
riod shows that CVcat has a stable user community which 
is still slowly increasing. The typical usage of CVcat is 
to request all available data on a specific object, which 
is normally queried by its name. List generation of ob- 
jects selected by their class ranks second in the usage of 
the database. The current growth rate of the catalogue 
is around 50 entries per month, mostly from the CVcat 
editors. 
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Fig. 4. Secondary mass plotted against orbital period 
(points) and period histogram (gray bars) for all CVs 
with known orbital period and secondary mass. The full 
line shows the relation from Eq. QJ, the short dashed line 
from Eq. © , the long dashed line from Eq. J3J . Full boxes 
and error bars are used for systems with estimated errors 
in M%, open boxes for secondary star masses with an un- 
known error range. The given (M2, P) data pairs represent 
w 10% of all known CVs, the histogram contains « 39% 
of all CVs known to CVcat. 

The data flow into CVcat originating from outside the 
CVcat core team is not yet satisfying. It is unclear why 
most users refrain from adding data from their own pub- 
lications. Users should bear in mind that the probability 
of having their papers cited increases if the information 
from their publications can be found in the database. 

4. Prospects 

The concept of CVcat has demonstrated the benefits of a 
public scientific catalogue with a globally-shared expertise 
of its users and editors. The general structure of CVcat is 
also applicable to other fields of astronomy. Hence, a more 
general software based on the experiences with CVcat 
which will allow the implemenation of catalogues for ar- 
bitrary object classes, called "AstroCat" , is currently be- 
ing developed. This software will also include additional 
features to improve the knowledge management of astro- 
nomical results and data: 

— Besides the "single number data" stored so far, the in- 
frastructure for storing more complex data products, 
e.g., light curves, spectra, and finding charts will be 
added. This feature should help to overcome a major 
shortcoming in the present method of scientific pub- 
lishing: most authors publish their reduced data only 
in the form of plots, making follow-up work on these re- 
sults rather unattractive. While these data may be ob- 
tained directly from the authors, experience shows that 
in many cases the data have been lost due to faulty 
hardware or storage media, with the probability of loss 



increasing dramatically with time since the publication 
of the original work. In addition, the authors may have 
left astronomy, preventing access to their data. If the 
data is available in CVcat, the re-examination of ob- 
servations would be much easier and independent of 
contact with the original author and the correspond- 
ing data archive. The storage of reduced and published 
data in a usable form at e.g. CDS, Strasbourg, is al- 
ready promoted by journals like A&A. 

— Information agents will allow users to be informed au- 
tomatically if new data is entered into the database for 
their objects of interest. This is a service that keeps 
the user informed about new publications on a given 
object or class of objects without having to log in to 
CVcat frequently. 

— An elaborated validation system will allow the users 
to add their comments to published data. This unique 
feature will add personal communication aspects to the 
database. Public discussion on details of the published 
data may arise from this. 

— To allow exact quoting of a specific state of the ever- 
changing catalogue, a "versioning method using a 
global identification number (GID) will be used: the 
GID is incremented with each change of the content of 
AstroCat (e.g. adding new data, marking data as cor- 
rect, marking data as the best available etc.). Every 
output resulting from the use of AstroCat will in- 
clude the current GID number. An older state of the 
database can be exactly reproduced by issuing a given 
GID (lower than the current one), and using AstroCat 
with such a specific GID will return always precisely 
the same results, independent of the actual state of the 
data base. Any statistical analysis based on data from 
CVcat should therefore include the GID of the CVcat 
state at the time of analysis, permitting a quantitative 
comparison with future studies. It is not necessary to 
rely on a fixed or frozen state so far only available in 
printed catalogues. 

— Data that have not yet been cross-checked by the edi- 
torial team will be tagged as such. This method com- 
bines both the unique speed of the CVcat concept and 
the expertise of the editorial team. As long as a new 
datum is visible but marked as unapproved, the user of 
the database can use the new entry under his own re- 
sponsibility, while an approved value is authoritative. 

— Hyperlinks to a large number of available web re- 
sources will be included for every object. This includes, 
e.g., links to other CV cataloges like SIMBAD, the 
ADS, arXiv.org, the different variable star observers 
archives, and many more. 

The development of the new CVcat and the 
AstroCat software is done in close cooperation with 
the users of the current implementation. In summer 
2003, the data from the current CVcat will be trans- 
ferred to the new database. By the end of 2003, the 
AstroCat /CVcat framework will be completed. Ideas 
from the editors and users of other interactive cata- 
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logues, e.g. the high-z database llde Mello et all l2002lh 
are highly appreciated. The AstroCat project is hosted 
at http : //astrocat . vmi-goettingen. de. 

5. Technical realization 

CVcat is implemented as a Perl script which runs on a 
Linux PC. This Perl script processes the user requests 
(HTTP GET/POST requests) and creates HTML pages 
that are delivered via the Apache web server. Hence, 
CVcat presents itself as an interactive web page. 

The data is stored in a MySQL data base to which 
the Perl script communicates using the standard Perl 
DBI/DBD interface. 

For the implementation of AstroCat, an XML layer 
will be included between the data base and the HTML 
layer. This XML layer can be used to automatically in- 
clude larger data sets into the database. Another possible 
application will be to use the AstroCat framework as an 
archive for reduced dat a from robotic telescopes that use 
RTML l|Hessmanll200l|) for their observation requests, ac- 
complished observations, and some of the metadata. A 
technical advantage of the usage of XML is the easy avail- 
ability of many very good tools specialized in the process- 
ing of XML data and the transformation of XML docu- 
ments into HTML pages. This will improve the quality and 
the speed of the implementation. PHP might be used in 
addition to Perl as the scripting language for the AstroCat 
programs. 

6. Summary 

With CVcat we have implemented an online catalogue for 
cataclysmic variable stars, which - for the first time - al- 
lows its users to add data instantly visible to all other 
users. The quality control is realized by a team of ex- 
perts who share the responsibility for the different object 
classes. From the experiences of one year of public usage 
of the catalogue we have compiled a number of new con- 
cepts which will be implemented in the next generation 
of CVcat in the context of a more general framework for 
astronomical databases, AstroCat. 
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